For designing and implementing Chinese words segmentation module in the search engine based on Nutch. this paper put forward a algorithm based on forwards maximum match algorithm (MM).
对于设计实现能够在基于nutch的搜索引擎中处理中文信息的中文分词模块,论文采用基于中文字典的正向最大匹配分词算法。
参考来源 - 基于WEB服务的空间信息专业搜索引擎的应用研究·2,447,543篇论文数据,部分数据来源于NoteExpress
The Chinese words segmentation and labeling are basis of the Chinese language processing.
汉语的分词及词性标注是汉语语言处理的基础。
Chinese words segmentation is a important work for Chinese language, processing with computer.
汉语分词是汉语言计算机处理的一项不可缺少的工作。
The segmentation of handwritten Chinese words and the classification of multi-sort financial notes images is the basis of implementation of notes automatic procession.
手写汉字分割和多类别金融票据图像分类是实现票据自动处理的基础。
应用推荐